Vowel recognition in continuous speech with application of MLP neural network

نویسندگان

  • Elzbieta Smolka
  • Wieslawa Kuniszyk-Józkowiak
  • Mariusz Dzienkowski
  • Waldemar Suszynski
  • Marek Wisniewski
چکیده

The aim of the present work was to find the answer to the question: To what extent can the multilayer perceptron be applicable in the automatic vowel recognition process in any given fragments of a particular speaker? Initial research was carried out with the use of recordings of 3 adult people’s speech. Vowel recognition was performed with the application of multilayer perceptron. On the input of the network, N-element vectors were fed, which consisted of sound levels values obtained every 0.02s as a result of spectral analysis. Each created network was taught to recognise 6 vowels – a, e, o, u, i, y as well as one pattern including all other fragments of an utterance – consonants and pauses. The networks in which a result of over 90 % correct classifications for all the time moments was obtained were used to carry out a test on a completely different set of data. The best result in that part of research was 92% vowel recognition. At the same time, only 50% time moments, which made up these vowels, were correctly recognised. The other half was recognised as other vowels or a different fragment of the utterance. There also occurred 15% incorrect recognition of time moments making up consonants or pauses.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Emotion Recognition Using Scalogram Based Deep Structure

Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...

متن کامل

A comparative study of OCON and MLP architectures for phoneme recognition

In this paper a comparative study between One-Class-OneNetwork (OCON) and Multi-Layered Perceptron (MLP) neural networks for vowel phoneme recognition is presented. The OCON architecture, first proposed by I.C.Jou et al 1991, is similar in design to a conventional feed-forward MLP, only each class had its own dedicated sub-network containing a single output node. Conventional MLPs usually consi...

متن کامل

Detection of vowel on set points in continuous speech using autoassociative neural network models

Detection of vowel onset points (VOPs) is important for spotting subword units in continuous speech. For consonant-vowel (CV) utterances, VOP is the instant at which the consonant part ends and the vowel part begins. Accurate detection of VOPs is important for recognition of CV units in continuous speech. In this paper, we propose an approach for detection of VOPs using autoassociative neural n...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Spotting Multilingual Consonant-Vowel Units of Speech Using Neural Network Models

Multilingual speech recognition system is required for tasks that use several languages in one speech recognition application. In this paper, we propose an approach for multilingual speech recognition by spotting consonant-vowel (CV) units. The important features of spotting approach are that there is no need for automatic segmentation of speech and it is not necessary to use models for higher ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Annales UMCS, Informatica

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2006